Extraction of user preferences from a few positive documents
نویسندگان
چکیده
In this work, we propose a new method for extracting user preferences from a few documents that might interest users. For this end, we first extract candidate terms and choose a number of terms called initial representative keywords (IRKs) from them through fuzzy inference. Then, by expanding IRKs and reweighting them using term co-occurrence similarity, the final representative keywords are extracted. Performance of our approach is heavily influenced by effectiveness of selection method for IRKs so we choose fuzzy inference because it is more effective in handling the uncertainty inherent in selecting representative keywords of documents. The problem addressed in this paper can be viewed as the one of finding a representative vector of documents in the linear text classification literature. So, to show the usefulness of our approach, we compare it with two famous methods Rocchio and Widrow-Hoff on the Reuters-21578 collection. The results show that our approach outperforms the other approaches.
منابع مشابه
A social recommender system based on matrix factorization considering dynamics of user preferences
With the expansion of social networks, the use of recommender systems in these networks has attracted considerable attention. Recommender systems have become an important tool for alleviating the information that overload problem of users by providing personalized recommendations to a user who might like based on past preferences or observed behavior about one or various items. In these systems...
متن کاملAutomatic Extraction of Semantic Preferences from Multimedia Documents
One of the most important topics in modern multimedia research is the treatment of documents and users at a semantic level. In this framework, the automated extraction of semantic preferences from multimedia content is an important problem. This paper is part of our ongoing work in the field of semantic multimedia analysis and retrieval; it extends on previous work on scene and shot detection, ...
متن کاملExtraction of Representative Keywords Considering Co-occurrence in Positive Documents
In linear text classification, user feedback is usually used to tune up the representative keywords (RK) for a certain class. Despite some algorithms (e.g. Rocchio) deal well with user positive and negative feedback to adjust the RKs, few researches have investigated how to adjust RKs only based on a small positive responses which is a popular case in the real-world application (e.g. users tend...
متن کاملAssessment of user preferences of campus green space at Ferdowsi University of Mashhad-Iran
Researchers have found that a user’s perception of the campus environment is related to quality life and academic accomplishment. In this study, we have analyzed the perceptions of more than 600 users at the Ferdowsi University of Mashhad to evaluate the level of green space use and to understand user preferences from aesthetics and safety aspects. The results show that for most of the responde...
متن کاملFeature extraction in opinion mining through Persian reviews
Opinion mining deals with an analysis of user reviews for extracting their opinions, sentiments and demands in a specific area, which can play an important role in making major decisions in such area. In general, opinion mining extracts user reviews at three levels of document, sentence and feature. Opinion mining at the feature level is taken into consideration more than the other two levels d...
متن کامل